Insights into Bacterial Genome Composition through Variable Target GC Content Profiling

نویسندگان

  • Scott Mann
  • Jinyan Li
  • Yi-Ping Phoebe Chen
چکیده

This study presents a new computational method for guanine (G) and cytosine (C), or GC, content profiling based on the idea of multiple resolution sampling (MRS). The benefit of our new approach over existing techniques follows from its ability to locate significant regions without prior knowledge of the sequence, nor the features being sought. The use of MRS has provided novel insights into bacterial genome composition. Key findings include those that are related to the core composition of bacterial genomes, to the identification of large genomic islands (in Enterobacterial genomes), and to the identification of surface protein determinants in human pathogenic organisms (e.g., Staphylococcus genomes). We observed that bacterial surface binding proteins maintain abnormal GC content, potentially pointing to a viral origin. This study has demonstrated that GC content holds a high informational worth and hints at many underlying evolutionary processes. For online Supplementary Material, see www.liebertonline.com .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data on true tRNA diversity among uncultured and bacterial strains

Complete genome sequences of two uncultured archaea (BX649197 and CR937008) and 10 uncultured bacteria (AC160099, FP245538-FP245540, FP312972, FP312974-75, FP312977, FP312985 and NZ_JPJG01000067) were used for creation of digital data of tRNA. tRNAscan-SE and ENDMEMO GC calculating tools were used for detection of tRNA, drawing their structures and calculation of GC percent. Seven archaeal and ...

متن کامل

The evolution of genomic base composition in bacteria.

Guanine plus cytosine (GC) content ranges broadly among bacterial genomes. In this study, we explore the use of a Brownian-motion model for the evolution of GC content over time. This model assumes that GC content varies over time in a continuous and homogeneous manner. Using this model and a maximum-likelihood approach, we analyzed the evolution of GC content across several bacterial phylogeni...

متن کامل

Evolution of genome base composition and genome size in bacteria

In bacteria and archaea, genome size and guanine–cytosine (GC) content are correlated (Bentley and Parkhill, 2004; Musto et al., 2006; Mitchell, 2007; Suzuki et al., 2008; Guo et al., 2009). These parameters show greater correlation in bacteria (Pearson’s correlation coefficient r = 0.46) than in archaea (r = 0.195) (Nishida, 2012a). The GC content in bacteria varies widely from 13.5% in “Candi...

متن کامل

Origin of an Alternative Genetic Code in the Extremely Small and GC–Rich Genome of a Bacterial Symbiont

The genetic code relates nucleotide sequence to amino acid sequence and is shared across all organisms, with the rare exceptions of lineages in which one or a few codons have acquired novel assignments. Recoding of UGA from stop to tryptophan has evolved independently in certain reduced bacterial genomes, including those of the mycoplasmas and some mitochondria. Small genomes typically exhibit ...

متن کامل

DNA structure constraint is probably a fundamental factor inducing CpG deficiency in bacteria

MOTIVATION It has been speculated that CpG dinucleotide deficiency in genomes is a consequence of DNA methylation. However, this hypothesis does not adequately explain CpG deficiency in bacteria. The hypothesis based on DNA structure constraint as an alternative explanation was therefore examined. RESULTS By comparing real bacterial genomes and Markov artificial genomes in the second order, w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of computational biology : a journal of computational molecular cell biology

دوره 17 1  شماره 

صفحات  -

تاریخ انتشار 2010